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1 Pop quiz 

Recall the propositional satisfiability (SAT) problem: we are given a propositional formula such as 

(s Vt) A(^s) A(^'uVs V^t) A(^sVt) 

■ consisting of a conjunction of clauses, where a clause is a disjunction of literals; a literal is either a variable 

^ ' V {& positive literal) or the negation of a variable (a negative literal). We are to decide if there is an 

assignment to the variables satisfying the formula, that is, an assignment under which every clause contains 
^ ■ at least one true literal. The example formula is satisfied by the assignment / where /(s) = f{u) = false 

and f{t) = true. The SAT problem is famously regarded as the first natural problem to be identified as 
NP-complete. 

Two special cases of the SAT problem that are well-known to be decidable in polynomial time are the 
2-S AT problem, in which every clause is a 2-clause, a clause having exactly two literals, as in the formula 

{-^u y v) ^ {^u V -^v) A {^v yw) A {^w yt) a {^t v v) 

and the HORN-SAT problem, in which every clause is a Horn clause, a clause having at most one positive 
literal, as in the formula 

{^y V xi) A i^y' V ^xi V y) A (^X2 V ^y) A {^xi V X2). 
Now consider the Quantified SAT problem. We are given a quantified formula such as 

yv\ft3u3w{{^u V u) A (^n V -^v) A {^v V A {^w V A {^t V v)) 
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or 

VyVy'Vy"3xi33;2((^y V xi) A {^y' V -^xi V y) A (^2:2 V ^y) A {^y" V -^xi V 2:2)), 

that is, a formula consisting of a conjunction of clauses preceded by a quantifier prefix in which all of the 
variables are quantified. Our task is to decide if the formula is true or false. 

In general, Quantified SAT is known to be PSPACE-complete. However, we can very well consider 
the special cases Quantified 2-SAT and Quantified Horn-SAT, where the clauses are restricted to 
be 2-clauses and Horn clauses, respectively. These two special cases of Quantified SAT are known to 
be polynomial-time decidable 01431. Let us focus on 112 (or "V3") formulas, that is, quantified formulas 
where the quantifier prefix consists of a sequence of universally quantified variables followed by a sequence 
of existentially quantified variables (as in the above two examples). I claim that there is a simple proof 
that the 112 formulas of Quantified 2-SAT and Quantified Horn-SAT are polynomial-time ti-actable. 
More precisely, I claim that there are simple reductions from Quantified 2-SAT to 2-SAT and from 
Quantified Horn-SAT to Horn-SAT that can be justified by short proofs. At this point, I would like to 
kindly ask the reader to stop reading, and attempt to demonstrate this. 

Warning: spoilers ahead. I now present the claimed reductions and accompanying proofs. Let us begin 
with Quantified Horn-SAT. What do we want to do? We are given a 112 formula 

^> = Vyi . . . yym^xi . . . 3xn(l) 

where (p is the conjunction of Horn clauses, and we want to efficiently decide if <I> is true. Let denote the 
universally quantified variables of and let denote the existentially quantified variables of ^. Here, 
because the formula $ has prefix class 112, we may observe that it is true if and only if for every assignment 
/ : 1$ — > {true, false}, there exists an extension /' : 1$ UX$ — > {true, false} of / under which the clauses 
(j) is true. How might we check this property? 

Given a single assignment /:!$—;• {true, false}, we can certainly check efficiently whether or not it 
has an extension /' : Y$ U X$ — > {true, false} under which (j) is true: we simply instantiate the universally 
quantified variables 1$ according to /, and then use any polynomial-time algorithm for HORN-SAT to 
decide if the resulting HORN-SAT formula is satisfiable. However, if we are to act in polynomial time, we 
definitely do not have time to perform this check for all assignments to the universally quantified variables, 
as there are 2 1^*1 such assignments — too many! 

Interestingly enough, it turns out that it suffices to perform the "extension check" for a restricted set 
of assignments, in order to determine truth of the formula <I>. For an integer j > 1 and a constant b G 
{true, false}, let [< j,b]^ denote the set of all assignments f : {true, false} to the universally 

quantified variables of ^> such that the number of variables mapped to b is less than or equal to j. For 
example, [< l,false](i) contains the assignment sending all variables in 1$ to true, and all assignments 
on y$ in which exactly one variable is sent to false. I claim that $ is true as long as all assignments in 
[< 1, false] $ have a satisfying extension. 

Proposition 1.1. Let $ be an instance 0/ Quantified Horn-SAT having prefix class 112. The formula 
<I> is true if and only if for every assignment / G [< 1, false]<i>, there exists an extension X$ — > 

{true, false} of f satisfying all clauses of^. 

Clearly, this proposition yields the correctness of the following procedure for deciding a formula $ 
from our class: for every assignment / G [< l,false]$, instantiate the variables Y$ according to / and 
use a polynomial-time algorithm for HORN-SAT to check if the resulting clauses are satisfiable; if they 
are satisfiable for every assignment /, return "true", otherwise, return "false". Since — relative to the size 
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of $ — there are polynomially many (in fact, linearly many) assignments in [< l,false]$ this procedure is 
indeed a polynomial-time procedure. 

In order to prove the proposition, we will make use of the following concept. Say that a proposi- 
tional formula is preserved by an operation g : {true, false}'^ {true, false} if for any k assignments 
/i, . . . , /fc under which ijj is true, the formula ijj is also true under the assignment g{fi, ■ ■ ■ , fk) defined by 
{g{fi, fk)){v) = g{fi{v), fk{v)) for all variables v. 

Example 1.2. We can verify that the boolean AND operation A preserves any Horn clause. Let /i V . . . V /c 
be a Horn clause, where the li denote literals, and let Vi denote the variable underlying the literal /». Let 
/i , /2 be assignments under which the clause is true. If one of the assignments /i , /2 satisfies a negative 
literal li = -iWj, then (A(/i, /2))(fi) = false and the clause is true under A(/i,/2). Otherwise, every 
negative literal is false under both /i and /2, and since /i, /2 are satisfying assignments, there must be a 
positive literal Ij = vj with fi{vj) = f2{vj) = true. Then (A(/i, f2)){vj) = true and the clause is true 
under A(/i,/2). □ 

Example 1.3. Now let ^ = Ci A . . . A be a conjunction of clauses for which there is an operation 
g : {true, false}'^ — {true, false} preserving each clause Q. We can verify that (p itself is preserved by 
g. Indeed, let fi, ■ ■ ■ , fk be any assignments under which is true. Then, consider any clause Cj. The 
clause Ci is true under all of the assignments fi, ■ ■ ■ , fk- Since g preserves Cj we have that Ci is true under 
g{fi, . . . , fk), and since our choice of Ci was arbitrary we have that is true under g{fi, . . . , fk). □ 

From these two examples, we see that every conjunction of Horn clauses is preserved by the operation A. 
With this fact in hand, we may now turn to the proof of the proposition. 

Proof. (Proposition ll.il) The "only if" direction is clear, so we prove the "if" direction. Let m denote |Y$|. 
We prove by induction that the following holds for alH = 1, . . . , m: every assignment / G [< false]$ has 
an extension /' : Y$ U X$ {true, false} satisfying the clauses (j) of <I>. This suffices, since [< m, false]$ 
is the set of all assignments to 1$. 

The base case i = I holds by hypothesis. 

Suppose that i > 2. Let / be an assignment in [< ijfalse]^. If / € [< i — l,false]$ then the desired 
extension /' exists by induction. So suppose that / maps exactly i variables S = {si, . . . , Si} to the value 
false. Let fi : {true, false} be the assignment mapping exactly the variables S \ {si} to false, 

and let f2 '■ Y^ ^ {true, false} be the assignment mapping exactly the variables S \ {32} to false. Since 
/ij /2 £ i — l,false]$, they have extensions /{, /2 satisfying 0. Since A preserves cj), the assignment 
/' = A(/(, 72) also satisfies (p. The assignment /' is an extension of /: 

• /'(si) = A(/((si), false) = false 

• /'(S2) = A(false,/^(s2)) = false 

• for all s € \ {si, S2}, f'{s) = A(false, false) = false 

• for all y G y$ \ 5, f'{y) = A(true,true) = true 

□ 

Looking now at Quantified 2-SAT, we give a proof of tractability (again, for 112 formulas) that 
is similar in spirit to the proof we just gave for Quantified Horn-SAT. Whereas Horn clauses were 
preserved by the operation A, 2-clauses are preserved by a different operation. Let majority denote the 
ternary operation defined by majority(x, y, z) = {x A y) V {x A z) V [y A z). That is, majority returns the 
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input value occurring at least twice. It is straightforward to verify that majority preserves any 2-clause, 
Having observed this, we can now establish a result similar in spirit to Proposition ll.il 



Proposition 1.4. Let ^ be an instance o/ Quantified 2-S AT having prefix class 112. The formula $ is true 
if and only if for every assignment / € [< 2, false]$, there exists an extension /' : Y$ U {true, false} 

of f satisfying all clauses of ^. 

As with Proposition II. 1[ we can readily infer a polynomial-time algorithm for the quantified formulas 
under study from Proposition II. 41 Namely, using a polynomial-time algorithm for 2-S AT, it can be decided 
whether or not for all / € [< 2, false]$, the desired extension /' exists. Since there are polynomially many 
assignments in [< 2, false]$, this can be carried out in polynomial time. 

Proof. This proof is structurally identical to the proof of Proposition 11.11 As in that proof, the "only if" 
direction is clear, so we prove the "if" direction. Let m denote \ Y^\. We prove by induction that the following 
holds for alH = 2, . . . , m: every assignment / G [< i, false]$ has an extension /' : Y^UX^ — s- {true, false} 
satisfying the clauses (p of <I>. This suffices, since [< m, false]^ is the set of all assignments to 1$. 
The base case i = 2 holds by hypothesis. 

Suppose that i > 3. Let / be an assignment in [< i,false]$. If / € [< i — l,false]$ then the desired 
extension /' exists by induction. So suppose that / maps exactly i variables S = {si, . . . , Si} to the value 
false. For j e {1, 2, 3}, let fj : 1$ —>■ {true, false} be the assignment mapping exactly the variables 5\{sj} 
to false. Since the assignments /i, /2, /s are in [< i — 1, false] they have extensions /{, /2, satisfying 
(j). Since majority preserves any 2-clause, by the discussion in Example 1 1.3 1 it also preserves cj), and thus the 
assignment /' = majority(/(, /g, f^) also satisfies 0. The assignment /' is an extension of /: 

• f'{si) = majority(/((si), false, false) = false 

• f'{s2) = majority(false, /2(s2), false) = false 

• /'(ss) = majority(false, false, /3(s3)) = false 

• for all s G 5" \ {si, S2, s^}, f'{s) = majority(false, false, false) = false 

• for all y € y$ \ 5, /'(y) = majority(true, true, true) = true 

□ 

I mention that the polynomial-time tractability of the special cases of Quantified SAT that have been 
discussed, without the 112 restriction, is proved using the notion of preservation by an operation in f25l. 

2 What's this all about? 

What just happened? The question of how to efficiently decide the truth of certain logical formulas was 
posed, and answered by considering operations preserving the formulas. This situation exemplifies a theme 
underlying an emerging line of research that studies the complexity of constraint satisfaction problems. 

^ Here is a verification: let Zi V Z2 be a 2-clause, let /i , /2 , /a be assignments satisfying this clause, and let vi , V2 denote the vari- 
ables underlying the literals Zi, Z2, respectively. If two of the assignments /i, /2, fs are equal on {111,^2}, then majority(/i, /2, /a) 
is equal to those two assignments on {vi,V2} and hence satisfies the clause. Otherwise, /i, /2, and /a, restricted to {vi,V2}, are ex- 
actly the three distinct assignments satisfying hVh, and it is can be seen that both literals h, I2 are true under majority (/i, f2, f3)■ 
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More details on that, please. The constraint satisfaction problem (CSP) is a general framework in which 
many search problems can be readily modeled; an instance of the CSP consists of a set of constraints 
on variables, and the goal is to determine if there is an assignment to the variables satisfying every one 
of the given constraints. A broad family of problems can be obtained from the CSP by restricting the 
so-called constraint language — a set of relations that can be used to form constraints. Each constraint 
language T gives rise to a particular computational problem, denoted by CSP(r), and a focal research 
question is to describe the complexity of CSP(r) for all constraint languages T. The family of problems 
CSP(r) is extremely rich, and includes graph homomorphism problems, the problem of solving a system of 
equations over various algebraic structures, and the problems SAT, 2-SAT and HORN-SAT. The research 
area studying the problems CSP(r) has recognized that a set of operations — an algebra — can be associated 
to each constraint language F in such a way that information on the complexity of the problem CSP(r) can 
be derived from these operations. This association has given way to a fruitful interaction among the areas of 
logic, complexity, and algebra. 

What of the results presented in the previous section? Both Propositions II . 1 l and 1 1 .4l have been known. 
For example, they were proved by Gradel [37J to obtain results in descriptive complexity; Proposition ll.il 
was also derived by Karpinski et al. p3| from theory establishing the tractability of Quantified Horn- 
SAT. In those papers these propositions were derived by arguments strongly based on the clausal forms of 
the formulas under study, in contrast to the algebraic arguments given here based on operations. I hope the 
reader will agree that the proofs given here are particularly short and simple, and yield evidence that the 
utilized algebraic viewpoint can shed light even on classically studied fragments of propositional logic. I 
believe it is also worth emphasizing that the proofs of Propositions 11.11 and 11.41 are structurally identical, 
and thus that the reasoning employed in both cases is generic and not heavily tied to the particular formulas 
under study. 

What happens in this article? In this expository article, I give a contemporary, algebraic treatment of 
the inaugural result on constraint languages, Schaefer's theorem on boolean constraint satisfaction prob- 
lems [56|. This theorem classifies the complexity of CSP(F) for all constraint languages F over a two- 
element domain. In particular, it gives a description of the constraint languages F such that CSP(F) is 
polynomial-time tractable, and shows that for all other constraint languages F, the problem CSP(F) is NP- 
complete. This theorem is of broad interest, as it provides a rich class of NP-complete boolean satisfiability 
problems, some of which have extremely simple descriptions, and which — as Schaefer himself envisioned — 
often facilitate the development of a NP-hardness proof. Following an introduction to the algebraic view- 
point on constraint satisfaction (Section|3]), I give a relatively short but complete proof of Schaefer's theorem 
(Sections|4]and|5]l. After this, I prove — again using algebraic techniques — an analog of Schaefer's theorem 
for quantified satisfiability problems (Section O, and then give a "fine" classification theorem for quantified 
satisfiability problems where the number of quantifier alternations is bounded (Section|7]l. In the last section 
of this paper (Section [8]), I discuss a recently discovered application of the algebraic viewpoint to a class of 
logical formulas falling outside the framework of constraint satisfaction, and open the question of finding 
more results of this form. 

Almost all of the results presented here appear either explicitly or implicitly in the literature: the ex- 
position draws upon many previous publications, including the works of Post ll53l and Rosenberg ||55]| on 
clone theory; the papers of Geiger |[36l and Bodnarchuk et al. ifTTI identifying a relevant Galois connection; 
the paper of Schaefer [56|; the papers of Jeavons, Cohen, and Gyssens [42], Jeavons [40^, and Bulatov, 
Jeavons, and Krokhin [20] connecting constraint satisfaction to clone theory and universal algebra; and the 
tractability result of Jeavons, Cohen, and Cooper [41]. The novelty here is that I attempt to give a unified 
and relatively short account of these results, and include a number of lesser-known proofs. 
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Care has been taken to make the presentation of this article self-contained, and I do not assume any 
background in any of the areas touched by this article, other than familiarity with basic complexity-theoretic 
notions such as reducibility and the complexity classes P and NP. I hope that a wide variety of readers will 
be able to take something away from this article — from those who would like to understand (a nice version 
of) the statement of Schaefer's theorem and catch a glimpse of its inner workings, to those interested in 
incorporating the presented ideas and techniques into their research toolboxes. Indeed, I have endeavored 
to give a streamlined proof of Schaefer's theorem and the other classification theorems, and believe that the 
presented proof of Schaefer's theorem could reasonably be taught in a few lectures of a course, under the 
assumption of a mature audience. 

Throughout this article, I provide exercises for the reader; some are relatively routine, while others offer 
a taste of deeper ideas. To close this section, the following exercises expanding upon the discussion in 
Section[T]are offered. 

Exercise 2.1. Define the operation minority to be the the ternary operation such that minority(a;, y, z) = 
x(S)y®z, where © denotes the usual exclusive OR operation. That is, if all of the inputs to minority are the 
same value, that value is the output; otherwise, minority returns whichever one of its inputs occurs exactly 
once. Show that any equation of the form f i ® • • • © f fc = c where the Vi are variables and c € {0, 1} is a 
constant, is preserved by minority. 

Exercise 2.2. Using the previous exercise, prove a result analogous to Propositions li.il and \1.4\ for II2 
formulas where the quantifier-free part is the conjunction of equations of the form described in the previous 
exercise. 

Exercise 2.3. Show that Proposition \1.4\ holds with ([< 1, false]$ U [< 0, true]$) in place of [< 2, false]$. 
Observe that the set of assignments ([< 1, false]$ U [< 0, true]$) is in general smaller than [< 2, false]<j,.' 
Also observe that, by duality, this proposition holds with ([< l,true]<j) U [< 0,false](j>) or [< 2,true]<j> in 
place of [< 2, false]$. 

3 Constraint satisfaction and polymorphisms 

In this section, we describe the computational problems of interest and the algebraic tools that will be used 
to study them. 

Definition 3.1. A relation over domain D is a subset of for some k > 1; k is said to be the arity of the 
relation. A constraint language over domain D is a set of relations over D. A constraint language is finite if 
it contains finitely many relations, and is boolean if it is over the two-element domain {0, 1}. 

By a domain D, we simply mean a set. Other than in the last section, we will be concerned primarily 
with constraint languages over a finite domain D. Also, note that we will use and 1 to denote the boolean 
values false and true. 

Definition 3.2. A constraint over a constraint language T is an expression of the form R{vi, . . . , Vk) where 
R is a relation of arity k contained in T, and the Vi are variables. A constraint is satisfied by a mapping f 
defined on the Vi if{f{vi), . . . , /(ffc)) G R- 

The computational problems we are interested in are defined as follows. 

Definition 3.3. Let T be a finite constraint language over domain D. The problem CSP(r) is to decide, 
given a finite set of variables V and a finite set of constraints over T with variables from V, whether or not 
there exists a solution (or satisfying assignment), a mapping f : V ^ D satisfying all of the constraints. 
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Observe that, for all constraint languages F, the problem CSP(r) is in NP: a variable assignment / : 
y — > D has polynomial size, and whether or not it satisfies all constraints can be checked in polynomial 
time. We remark that when discussing problems of the form CSP(r), we will confine our attention to finite 
constraint languages F. This permits us to avoid certain technicalities and discussion of how relations are 
represented, although we should note that the complexity of infinite constraint languages is considered in 
the literature. 

Example 3.4. We demonstrate that 3-SAT, the case of the SAT problem where every clause has exactly 
three literals, can be viewed as a problem of the form CSP(r) for a boolean constraint language F. Define 
the relations i?o,3> ^i,3> ^2,3. and R^ ^ by 

Ro,3 = {0,1}3\ {(0,0,0)} 

Ri,3 = {o,i}3\ {(1,0,0)} 

R2,3 = {0,1}3\ {(1,1,0)} 
R3,3 = {0,1}3\ {(1,1,1)} 

Notice that for any variables x,y,z,we have the following equivalences: 

-^0,3(2;,^, 2;) = (xVyVz) 

-^1,3(2;,^, 2:) = i^xVyVz) 

R2,3{x,y,z) = i^xV^yyz) 

R3,3{x, y,z) = (^xV^yV^z) 

That is, (as an example) the constraint i?i_3(x, y, z) is satisfied by an assignment if and only if the clause 
(-ix V y V z) is satisfied by the assignment. 

Let F3 be the constraint language {Ro^3, Ri^3, R2,3, ^3,3}- Every instance of the 3-SAT problem can 
be readily translated into an instance of CSP(F3) having the same satisfying assignments. For instance, 
consider the 3-SAT instance 

(^u V s V ^t) A (^s V t V u) A (s V t V ^v) A (v V ti V s). 

It is equivalent to the CSP(F3) instance with variables {s, t, u, v} and constraints 

{R2,3{U, t, s), iii,3(s, t, v),Ri^3{v, S, t),Ro^3{v, U, s)}. 

Similarly, any instance of CSP(F3) can be formulated as an instance of 3-SAT. □ 

It is well-known that 3-SAT is NP-hard; we formulate this as follows. 

Proposition 3.5. The problem CSP(F3), where F3 as is defined in Example \3.4\ is NP-hard. 

Example 3.6. Schaefer ||56l identified the problem Not- all-equal satisfiability: given a collection 
of sets Si, ... , Sm each having at most 3 members, can the members be colored with two colors so that no 
set is all one color? We show that this problem is equivalent to CSP(F) where F contains the single relation 
i2NAE = {0,l}3\ {(0,0,0), (1,1,1)}. 

Take an instance Si,. . . ,Sm of NOT- ALL-EQUAL SATISFIABILITY; we translate it to an instance of 
CSP(F) by creating, for each set Si, a constraint R^hEis, t, u) where s, t, u are such that 5, = {s, t, u}. It 
is readily seen that a coloring / : (U™ ^^j) — > {0, 1} satisfies the condition given in the problem description 
if and only if it satisfies all of the constraints. Similarly, an instance of CSP(F) can be translated to an 
instance of NOT- ALL-EQUAL SATISFIABILITY by creating, for each constraint R^AEis, t, u), a set {s, t, u}. 
□ 
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We now give a notion of definability for relations. As we will see momentarily, this notion will permit a 
constraint language to "simulate" relations that might not be inside the constraint language. 

Definition 3.7. We say that a relation R C is pp-definable (short for primitive positive definable) /rom 
a constraint language T if for some m > there exists a finite conjunction C consisting of constraints and 
equalities (n = v) over variables {vi, . . . ,Vk, xi, . . . , Xm} such that 

R{vi,. . . ,Vk) = 3xi . . . 3xmC. 

That is, R contains exactly those tuples of the form {g{vi), . . . , g{vk)) where g is an assignment that can be 
extended to a satisfying assignment ofC. We use (T) to denote the set of all relations that are pp-definable 
from r. 

Example 3.8. Let S = {(0, 1), (1, 0)} be the disequality relation over {0,1}. The following is a pp- 
definition of S from the constraint language of Example |3.4[ 

S{y, z) = 3x(i?o,3(2;, y, z) A -Ri,3(2;, y, z) a R2,^{z, y, x) A i?3,3(^, y, x)). 

□ 

When all relations in a constraint language T' are pp-definable in another constraint language F, we have 
that the constraint satisfaction problem over V reduces to that over F. 

Proposition 3.9. (implicit in I140\I } Let F and T' be finite constraint languages. IfT'Q (F), then CSP(r') 
reduces to CSP(F). 

Note that the only notion of reduction used in this article is many-one polynomial-time reduction]^ 
Proof. From an instance of CSP(F'), we create an instance of CSP(F) in the following way. We loop over 
each constraint C = R{vi, . . . , Vk) in (/>, performing the following operations for each: let 3xi . . . 3xmC 
be a pp-definition of C over F, rename the existentially quantified variables xi, . . . , Xm if necessary so that 
they are distinct from all variables of other constraints, and replace C with the constraints in C. It is clear that 
each replacement preserves the satisfiability of the CSP instance; moreover, since there are finitely many 
relations in F', each has a pp-definition of constant size, and all of the replacements can be carried out in 
polynomial time. The result is a set of constraints over F and equalities; each equality (u = v) can be 
eliminated by removing it from the set and replacing all instances of (say) v with u. □ 

From this proposition, it can be seen that a finite constraint language F is tractable if and only if all finite 
subsets F' of (F) are tractable: the forward direction follows from the proposition, while the backwards 
direction follows by taking F' = F. This observation can be interpreted as saying that the tractability of a 
constraint language F is characterized by the set (F), and justifies focusing on the sets (F). Interestingly, we 
will show that the set of relations (F) is in turn characterized by a set of operations called the polymorphisms 
of F. 

Definition 3.10. An operation f : D is a polymorphism of a relation R C D'' if for any choice of 

m tuples (ill, • • • , ^ifc); • • • > (^mi, • • • > tmk) f^om R, it holds that the tuple obtained from these m tuples by 
applying f coordinate-wise, (/(tn, . . . , tmi), ■■■ , fihk, ... , tmk)), is in R. 

That is, an operation / is a polymorphism of a relation R if R satisfies a closure property: applying 
/ to any tuples in R yields another tuple inside R. This notion of closure is essentially equivalent to that 
required by the notion of preservation used in Section[Tl however, whereas there we spoke of formulas being 
preserved by operations, here we speak of relations having polymorphisms. We now give some examples, 
which consider the constraint language F3 = {i?o,3) -^1,3) ^2,3,-^3,3} from Example |3.4[ 

We remark that, by making use of the result of Reingold 1541 . the reduction of Proposition |3.9| can be carried out in logarithmic 
space. 
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Example 3.11. The relation i?o,3 has the boolean OR V operation as a polymorphism; we can see this as 
follows. Suppose that (tii, ti2, ^13), (^21, ^22; ^23) are two tuples from i?o,3- There is some coordinate of the 
first tuple equal to 1, that is, there exists j G {1, 2, 3} such that tij = 1. It follows that tij V t2j = 1, and 
thus the tuple (tn V ^215*12 V t22> ^13 V t23) is also contained in i?o,3- 

The relation i?i 3 also has the boolean OR V operation as a polymorphism. Let us take two tuples 
(^115^125^13)5 (^215^225*23) from -Ri,3. If the first tuple (tn, ti25 tis) is equal to (0, 0, 0), the OR of the two 
tuples (til V t2i5 ^12 V t22, ti3 V ^23) is equal to the second tuple, which is contained in R by assumption. 
Otherwise, either ti2 or tis is equal to 1, implying that one of the values (ti2 V t22)5 (^13 V t23) is equal to 1 
and that (tn V t2i5 ^12 V t225 ^13 V t23) is contained in i2i,3. 

The relation i?2,3 does not have the boolean OR V operation as a polymorphism. This is because the 
two tuples (1, 0, 0), (0, 1, 0) are both in i?2,3, but their OR, the tuple (1, 1, 0), is not. □ 

Example 3.12. None of the relations in r3 = {i?o,35 -^1,35 -^2,35 -^3,3} have the majority operation as a poly- 
morphism. Indeed, let i? be any relation of the form {0, l}^\{(ai, 02, 03)} with ai, 025 03 ^ {05!}- Observe 
that the tuples {^ai, 02, 03) (ai, -102, 03) (ai, 025 -■03) are all contained in R, but applying the majority op- 
eration to these tuples yields the tuple (majority(-iai, ai, ai), majority(a25 -■025 02), majority(a3, 03, -■03)) 
which is equal to (oi, 02, 03) and is hence not in R. □ 

Upon initial acquaintance, the notion of polymorphism may appear unrelated to the notion of pp- 
definability. Actually, it turns out that the polymorphisms of a constraint language F contain enough in- 
formation to derive the set of relations (F) ! We say that an operation / : — > L> is a polymorphism of a 
constraint language F if it is a polymorphism of all relations i? G F, and we use Pol(F) to denote the set of 
all polymorphisms of F, that is, 

Pol(F) = {/ : Vi? € F, / is a polymorphism of R}. 

Also, for a set of operations O, we use Inv(O) to denote the set of relations having all operations in O as a 
polymorphism, that is, 

Inv(O) = {R : V/ € O, / is a polymorphism of R}. 
Theorem 3.13. Let T be a finite constraint language over a finite domain D. It holds that (T) = lnv(Pol(F)). 

In words, this theorem states that a relation is pp-definable from F exactly when all polymorphisms of 
F are polymorphisms of it. Again, this result shows that the set of relations (F) can be derived from the set 
of operations Pol(F), in particular, by applying the Inv(-) operator. This theorem was established by Geiger 
and Bodnarchuk et al. ifj^llTTl FI The proof of the ^ direction given here is based on a proof that appeared 
in Dalmau's Ph.D. thesis ||29l . 

Proof. We first show that (F) C lnv(Pol(F)); this is the more straightforward direction. Let 

R{vi, ...,Vk) = 3xi... 3xmC 

be the pp-definition of a relation R over F. Suppose that g : D'^ ^ is a polymorphism of F; we want to 
show that 5 is a polymorphism of R. 

Consider first the relation R' defined by 

R!{vi, ...,Vk,Xi,.. .,Xm) = C. 

Let ti, . . . ,tnhe tuples in R'. Each tuple ti has the form {fi{vi), . . . , fi{vk), fi{xi), . . . , fi{xm)) for an as- 
signment fi satisfying C. Let S{wi, . . . ,wi)he. any constraint or equality of C. We have that {fi{wi), . . . , fi{wi 

We remark that the Pol(-) and Inv(-) operators give rise to an instance of a Galois connection. 
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S for each /j. Since S has as a polymorphism, we have that the arity / tuple with g{fi{wi), . . . , fn{wi)) 
as its ith coordinate is in S. Thus the mapping sending each variable v G {vi, . . . ,Vk,xi, . . . , Xm} to 
g{fi{v), . . . , fn{v)), satisfies all constraints of C. We then have that the the tuple g{ti, . . . , where g is 
applied coordinate-wise, is in R' , and thus that g is a polymorphism of R! . (This is essentially the argument 
of Example 1 1.31 but in slightly different language.) 

Now, we have that R' has g as a polymorphism, and want to show that 

R{vi, . . . jWfc) = 3X1 . . . ^XmR'{vi, ...,Vk,Xi,.. .,Xm) 

has (7 as a polymorphism. Let ti, . . . , t„ be tuples in R. Each tuple tj = {tn, . . . ,tik) has an extension 
t'i = {til, • • • 5 ii(fc+m)) contained in R'. We want to show that the tuple g{ti, . . . , where g is applied 
coordinate-wise, has an extension in R'. The tuple g{t[, . . . , t^) is such an extension. 

We now prove (F) D lnv(Pol(r)). Suppose that R G lnv(Pol(r)). Let n denote the arity of R, let m 
denote the number of tuples in R, and let (in, . . . , tin), ■ ■ ■ , {tmi, ■ ■ ■ , tmn) denote the tuples of R (in any 
order). We may assume that there are no distinct coordinates i, j such that {tn, . . . , tmi) = (^ij, ■ ■ ■ , tmj), 
as these may be eUminated one by one using the following observation: let a{l) a{n — I) denote the 
sequence 1, . . . , n with j removed; then, if 3xi . . . 3xmC is a pp-definition for the relation R' defined by 
-^'(^cr(i)> • • • ) ''^o-(n-i)) = ^Vj{R{vi, . . . , Vn)), wc havc that 3xi . . . 3xm{C A {vi = Vj)) is a pp-definition 
for R{vi,... ,Vn)- 

We create a conjunction of constraints C over the variable set D™. Our conjunction C contains, for 
each relation S G T and sequence of m tuples (sn, . . . , si^), . . . , {smi, • • • , Smfc) G 5*, a constraint 
S{{sii, . . . , Smi), • • • , (sifcj • • • 1 -Smfc)); here, k denotes the arity of S. It is straightforward to verify that 
an assignment / : D"^ — > D satisfies all of the constraints in C if and only if it is an m-ary polymorphism 
of r. 

Now consider the relation R' defined by 

R'iitll, • • • , tmi), • • • , {tin, ■ ■ ■ , tmn)) = . . . 3VpC 

where vi, . . . ,Vp are the tuples in D"^\{{tii, . . . , tmi), ■ ■ ■ , {tin, ■ ■ ■ , tmn)}, in any order. Since it is exactly 
the m-ary polymorphisms of F that satisfy C, we have 

R' = {{f{tii,...,tmi),---,f{tin,---,tmn)) ■ f S Pol (F) ,/ has arity m}. 

We claim that R' = R, which yields the proof. 

R' C R: The tuples (tn, . . . , tin), ■ ■ ■ , {tmi, ■ ■ ■ , tmn) are in R; since every / G Pol(F) is a polymor- 
phism of R, it follows that (/(tn, . . . , tmi), f{tln, tmn)) G R- 

R C R': Let vTj : D"^ — > D denote the function that projects onto the ith coordinate, that is, the 
function such that iTi{di, . . . , dm) = di for all {di, . . . , dm) G D™. Each function vTj is a polymorphism 
of all relations, and is hence a polymorphism of F; it follows that for each i G {1, • • • ,m}, the tuple 

{til, ■ ■ ■ ,tin) = {"^{tu, ■ ■ ■ ,tml), ■ ■ ■ ,T^{tln, ■ ■ ■ ,tmn)) is in R' . □ 

Theorem 13 . 1 3 1 also holds for infinite constraint languages; we leave the proof of this as an exercise. 

Exercise 3.14. Prove Theorem \3.13\ for infinite constraint languages (constraint languages containing in- 
finitely many relations) over finite domains by adapting the given proof for Theorem \3.13\ 

We have observed that the complexity of CSP(F) effectively depends on the set (F), and we just showed 
that the set (F) can be computed from the polymorphisms of F. This suggests that the polymorphisms of a 
constraint language F can be used to derive information on the complexity of CSP(F), which we show as 
follows. 
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Theorem 3.15. Let T and V be finite constraint languages. //'Pol(r) C Pol(r'), then V C (F) and 
CSP(r') reduces to CSP(r). 



This theorem was proved by Jeavons POll . and can be seen as an operational analog of Proposition [ 
Note that this theorem implies that two constraint languages F, V having the same polymorphisms reduce 
to each other, and hence are of the same complexity. 

Proof. The containment Pol(r) C Pol(r') implies the containment lnv(Pol(r)) D lnv(Pol(r')). Invoking 
Theorem 13.131 we obtain that (F) 5 (F'). This implies that F' C (F), and the conclusion follows from 
Proposition 13.91 □ 

Having established that the complexity of a constraint language F is intimately linked to its set of poly- 
morphisms Pol(F), a natural inclination at this point is to inquire as to the structure of the sets Pol(F). In 
fact, each set of operations having this form is an instance of an algebraic object called a clone. 

Definition 3.16. A clone is a set of operations that 

• contains all projections, that is, the operations vr^ : — > D with 1 < i < m such that 
7r™(di, . . . , dm) = difor all di, . . . , dm & D, and 

• is closed under composition, where the composition of an arity n operation f : — s- D and n arity 
m operations fi, ■ ■ ■ , fn '■ D"^ D is defined to be the arity m operation g : D"^ — > D such that 
g{di,. . .,dm) = f{fi{di, . . .,dm), ■ ■ • ,/n(c^i, • • ., dm)) for all di, . . . ,dm ^ D. 

We will say that an operation / (or more generally, a set of operations F) generates an operation g if 
every clone containing / (respectively, F) also contains g. 

Example 3.17. Let / : ^ D be an operation. The operation / generates any operation g : — > D 
obtained by reordering and identifying arguments of /. Formally, let i : {!,... ,n} — > {l,...,m}be 
any mapping. We claim that / generates the operation g such that g{di, . . . , dm) = f{di{i), • • • , c^j(n)) 
for all di, . . . , dm G F). This is because g may be viewed as the composition of / with the projections 

Example 3.18. Let / : — be a binary operation. We show that / generates the operation g{x, y, z) = 
f{x, f(y, z)). By the previous example, / generates the operation f23{x, y, z) = f{y, z). The operation g 
may be viewed as the composition of / with irf and /23. □ 

The computer scientist will recognize that a set of operations F generates any operation g{xi, . . . , Xm) 
that can be represented as an acyclic circuit where the inputs are the variables xi, . . . , Xm and the gates are 
operations from F. 

Proposition 3.19. For all constraint languages T, the set of operations Pol(F) is a clone. 

Proof. Let F be a constraint language, and let ti , . . . , be elements of a relation R from F, 

For any i, we have vr™ {ti, . . . ,tm) =ti which is in R by assumption. Note that we intend that operations 
are applied coordinate-wise to tuples, as in Definition 13. 101 

Now, suppose that / : D and /i , . . . , /„ : D"^ — > L) are polymorphisms of R, and that g is 

the composition of / and /i, . . . , /„. Define s7 = fi{ti, ■ ■ ■ , tm) for alH = 1, . . . , n. Since the fi are 
polymorphisms of R, we have that all tuples are in R. Since / is also a polymorphism of R, it follows 
that the tuple s = /(si, . . . , s^) is contained in R. Clearly, g{ti, . . . , tm) = s, and we conclude that 5 is a 
polymorphism of R. □ 
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We are now ready to state Schaefer's theorem, which describes the complexity of CSP(r) for all boolean 
constraint languages T. Given the connection between the complexity of CSP(r) and the set of polymor- 
phisms of r, it should come as no surprise that there is a description of the complexity of CSP(r) in terms 
of polymorphisms. Not only is this the case, but there is a remarkably clean description based on polymor- 
phisms, stating that a boolean problem CSP(r) is tractable precisely when the constraint language F has 
one of six polymorphisms. 

Here comes the theorem statement. We refer to the unary operation uq : {0, 1} {0, 1} such that 
tio(O) = uo{l) = as the constant operation 0, and similarly to the unary operation ui : {0, 1} {0, 1} 
such that tii(O) = tii(l) = 1 as the constant operation 1. Recall that the operation majority : {0, 1}^ — > 
{0, 1} is defined by majority(x, y, z) = (x A y) V (x A z) V (y A z), and the operation minority : {0, 1}^ — > 
{0, 1} is defined by minority(x, y, z) = x (B y ® z. 

Theorem 3.20. (Schaefer's theorem 061/ - algebraic formulation) Let T be a finite boolean constraint 
language. The problem CSP(r) is polynomial-time tractable ifV has one of the following six operations as 
a polymorphism: 

• the constant operation 0, 

• the constant operation 1, 

• the boolean AND operation A, 

• the boolean OR operation V, 

• the operation majority, 

• the operation minority. 

Otherwise, the problem CSP(r) is NP-complete. 

Schaefer's theorem was originally formulated in terms of properties of relations 1561 : Jeavons [40] 
recognized that the algebraic formulation was possible. Jeavons BOl also pointed out that the algebraic 
formulation gives rise to a polynomial-time test for deciding if a given constraint language satisfies one of 
the six tractability conditions. However, the algebraic formulation not only makes it easy for machines to 
test for tractability, it makes it easy for human beings as well! 

Exercise 3.21. Let T be the constraint language {Ro;i, Rz;i} where i?o,3 and R^-^ are defined as in Exam- 
ple \3.4\ Show, using Theorem I J. 201 that CSP(r) is NP-complete. 

Exercise 3.22. Let F be the constraint language {^^nae}. where Rmae = {0, 1}^ \ {(0, 0, 0), (1, 1, 1)} is 
the relation from Example \3.6\ Show, using Theorem \3. 20\ that CSP(r) is NP-complete. 

Exercise 3.23. Let T be the constraint language {-Runs}, where i?iin3 denotes the ternary relation con- 
taining all tuples with exactly one 1, that is, {(1, 0, 0), (0, 1, 0), (0, 0, 1)}. Show, using Theorem \3.20\ that 
CSP(r) is NP-complete. 

Exercise 3.24. Let Cq be the arity one relation {(0)}, Ci be the arity one relation {(!)}, and S be the 
ternary relation 

{{a,b,c) e {0,lf : (a = 6) V (6 = c)}. 
Let r be the constraint language {Cq, Ci, S}. Show, using Theorem \3. 20\ that CSP(r) is NP-complete. 
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4 Schaefer tractability: the good news, first 



In this section and the next, we establish Schaefer's Theorem 13.201 We begin with the good news in this 
section by establishing that each of the six operations listed in the theorem statement, as polymorphisms, 
guarantee tractabiUty. We consider each of the operations in turn. In each case, we assume that F is a 
finite constraint language having the operation being considered as a polymorphism, and demonstrate that 
CSP(r) is polynomial-time tractable. 

The constant operation 0. In this case, an instance of CSP(r) is satisfiable if and only if for all constraints 
R{vi, . . . ,Vk), the relation R is non-empty. Clearly, this condition is necessary for satisfiability. When the 
condition holds, we claim that the function mapping all variables to satisfies all constraints. This is 
because, in a constraint . . . , f^), if is non-empty, applying the polymorphism to any tuple in R 

yields the all-zero tuple (0, . . . , 0). 

The constant operation 1. The reasoning in this case is identical to the case of the constant operation 0, 
but with the value 1 in place of 0. 

The boolean AND operation A. In this case, we can apply a general inference algorithm for constraint 
satisfaction problems called arc consistency^ Assume that we have an instance of the CSP with variable 
set V and domain D. For a constraint C = R{vi, . . . , Vk), define 7r„^(C) = {ti : {ti, . . . , tk) G R], for all 
variables vi, . . . ,Vk, and TTwiC) = D for all variables w \ {vi, . . . , Vk}- Note that for any solution / 
to the CSP, and any constraint C, and variable v, it must hold that f{v) G ■Ky{C). 

The following is the arc consistency algorithm. The intuition is that, using the sets 7r^,(C), we "tighten" 
the constraints by removing unusable tuples from the relations. If some relation becomes empty, we can 
conclude that the entire problem is unsatisfiable. 

Arc consistency algorithm 
Input: an instance of the CSP. 

1 For each variable v, define to be ncvr„(C) where the intersection is over all constraints C. 

2 For each constraint R{vi, . . . ,Vk), replace R with R n {D^-^ x • • • x D^^). 
If R becomes empty, then terminate and report "unsatisfiable". 

3 If any relations were changed in step 2, goto step 1. Otherwise, halt. 

Let us study this algorithm. First, we show that if it reports "unsatisfiable", it does so correctly. As we 
mentioned, if / is a solution to the input CSP, then for all constraints C and all variables v, it must hold that 
f{v) G vr^(C). This implies that if / is a solution to the CSP, we have f{v) G for all variables v, where 
Dy is the set computed in step 1. It follows that step 2 preserves the set of solutions to the CSP: since any 
solution / obeys f{v) G for all variables v, a constraint R{vi, . . . , v^) is satisfied by / even when R is 
replaced with R n {D^^ x • • • x D^^,). 

Next, we show that if the algorithm halts in step 3, then there exists a solution, assuming that each 
relation originally had the A operation as a polymorphism. Define the sets Dy as in step 1 of the algorithm, 
and define a mapping f : V ^ {0,1} as follows: 

r ifA, = {0} 

f{v) = \ 1 ifZ)„ = {l} 
[ ifDy = {0,1} 

^ It should be noted that the hterature contains many variants and definitions of arc consistency. 
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We claim that / is a satisfying assignment. Let R{vi, . . . , Vk) be any consti^aint. Let s = (si, . . . , Sk) be 
the tuple obtained by applying the A operation to all tuples of R, in any order. That is, let ti, . . . , be the 
tuples of R (in any order), and define s to be 

{{...{{hAh)At^)A...)Atm) 

where the A operator is applied to two tuples coordinate-wise, as in Definition 13.101 Since the rela- 
tion R originally had A as a polymorphism and contained the tuples ti, . . . ,tm, it suffices to show that 
(/(f i), . . . , f{vk)) = (si, . . . , Sk)- Let Vj be one of the variables vi, . . . ,Vk, and for each f = 1, . . . , m, let 
us denote the tuple U by {tn, . . . , tj^). If € D^., then there exists a tuple ti such that tij = 0, and thus 
Sj = 0. Otherwise, D^^ = {1}, and for all tuples tl we have tij = 1, from which it follows that sj = 1. 

The boolean OR operation V. The reasoning in this case is identical to the case of the boolean AND 
operation A, but with the roles of the values and 1 swapped. 

The operation majority. Let be a set of constraints over variable set V. We say that f : W ^ D, for 
W a subset of V, is a partial solution of (p if for every constraint R{vi, . . . , v^) € (/>, there exists a tuple 
{di, . . . ,dk) € R such that f{vi) = di for all Vi € W . We give an inference algorithm in the spirit of the 
arc consistency algorithm given for the boolean AND operation. 

Algorithm for majority polymorphism 
Input: an instance of the CSP with variable set V . 

1 For each non-empty subset W = {wi , . . . ,'Wi] ofV of size I < 3, 
add the constraint D\wi, ... ,wi) to (f>. 

2 For each constraint R{wi, ... ,wi) of cj) with / < 3, compute the set 

R' = {{f{wi), . . . , f{wi)) I / : {wi, . . . , wi} — > D is a partial solution of the instance (p}. 
Then, replace R with R'. 

If R becomes empty, terminate and report "unsatisfiable". 

3 If any relations were changed in step 2, goto step 2 and repeat it. Otherwise, halt. 

First, we observe that each replacement in step 2 preserves the set of solutions. This is because any 
restriction of a solution is a partial solution. It follows that if the algorithm reports "unsatisfiable" in step 2, 
it does so correctly. 

We thus need to show that if the algorithm halts in step 3, then there exists a solution, assuming that 
each relation originally had the majority operation as a polymorphism. (We assume that the CSP has three 
or more variables; if it has two or fewer, it is clear that there exists a solution.) It is straightforward to verify 
that each replacement of step 2 preserves all polymorphisms of the CSP, so when the algorithm halts, the 
resulting CSP has the majority operation as a polymorphism. The following definition is key. 

Definition 4.1. Let n > 0. An instance of the CSP with variable set V has the n-extension property if, given 
any subset W CV of size \ W\ = n and a variable v G V, any partial solution f : W ^ D can be extended 
to a partial solution f : W U {v} D. 

Assume that the algorithm halts in step 3; then, for every pair of variables W = {w, w'} there exists 
a partial solution f : W ^ D. This is because there exists a constraint S{w, w') with W as its variables; 
the relation S is non-empty and so we can define / to be any assignment such that {f{w),f{w')) € S. 
Moreover, (f) has the 2-extension property: suppose that W = {w,w'} is a pair of variables, € y is a 
variable, and / : — > is a partial solution. If v G W , then / itself is the desired extension, so we 
assume that v ^ W. There exists a constraint T{w, w' ,v) in the CSP instance; since / is a partial solution. 
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there exists a tuple (6, b', d) ^ T such that f{w) = b, f{w') = b'. The extension /' : {w, w', v} D of f 
such that f (v) = d is a. partial solution. 

The following theorem shows that, as a consequence of (p having the 2-extension property, it has the 
n-extension property for all n > 3. 

Theorem 4.2. Let (p be an instance o/CSP(r) where T has the majority operation as a polymorphism. If(j) 
has the 2-extension property, then for all n > 3 it holds that (j) has the n-extension property. 

Let ui , . . . , Ura be an ordering of the variables V in cf). We have shown that there is a partial solution on 
any pair of variables, so in particular there is a partial solution /2 : {ui,U2\ — > D. In addition, we showed 
that the 2-extension property holds, so there is a partial solution /s : {ui,U2,U2,} — > D. By the theorem, in 
an iterative manner we can define, for all n > 3, a partial solution fn+i '■ {ui, ■ ■ ■ , Un+i} D extending 
the partial solution /„ : {ui, L>. The mapping fm-.V^Dis then a total solution. Therefore, 

we need only prove the theorem to conclude our discussion of the majority operation. 
Proof. We prove this by induction. Suppose that (p has the n-extension property, with n > 2; we show 
that (j) has the {n + l)-extension property. Let be a set of size n + 1, let u be a variable in V, and let 
f : W ^ Dbe a. partial solution. We want to prove that there is an extension f : W U {v} ^ D of f that 
is a partial solution. If u S W, we can take /' = /, so we assume that v ^ W. Let wi,W2,w-i be distinct 
variables in W, and set Wi = W \ {wi} for i S {1, 2, 3}. Let /i, /2, /s be the restrictions of / to Wi, W2, 
and VF3, respectively. Since (j) has the n-extension property, for each i G {1, 2, 3} there exists an extension 
fl'.WiU {v} Dof fi. Define /' to be the extension of / with f'{v) = majority(/( (u), f^{v), f!i{v)). 

We claim that /' is a partial solution. Let R{vi, . . . ,Vk) be any constraint. Since each fl is a partial 
solution, for each i € {1, 2, 3} there exists a tuple {d\, . . . , d^) S R such that f[{vj) = when /j' is defined 
on Vj. We show that /' is a partial solution via the tuple (majority (d}, d^, df), . . . , majority (d^, d\,d\)). We 
consider each variable Vj , in cases. 

• Case: Vj = wi for some i G {1,2,3}. For the values s G {1,2,3} \ {%}, we have that fs{vj) 
is defined and equal to = f{vj). So, two of the values d^,dj,d^ are equal to f{vj) and thus 
/'(^.■) = M) = majority(4,d2,d3). 

• Case: Vj ^ W \ {■wi,W2,'Ws}- For each i € {1, 2, 3}, it holds that f'{vj) = f{vj) = fi{vj) = and 
thus f'{vj) = majority((ij, d^, 

• Case: vj = v. We have f'{v) = maior\ty{f[{v), f2{v), f^{v)) = majority(dj, d^). 

□ 

We remark that the presentation of this proof is based on the proof of a more general result in 1261 . 

The operation minority. We begin by showing that any constraint can be converted into a conjunction of 
linear equations. Here, by a Unear equation we mean an equation of the form ai ® • • • © a/ = 61 © • • • © 6m 
where each and each bi is either a constant or a variable. 

Theorem 4.3. Let R C {0, l}'^ be a relation having the minority operation as a polymorphism. Then the 
constraint R{xi, . . . , x/^) can be expressed as the conjunction of linear equations. 

Proof. We prove this theorem by induction on the arity k. The theorem is straightforward to prove for arity 
/c = 1, so we assume that k > 2. Define the relation Rq C D^^^ by Rq{x2, • • • , x^) = i?(0, X2, ■ ■ ■ , x^) 
and define the relation Ri C D^~^ by Ri{x2, • • • , Xk) = R{1, X2, ■ ■ ■ , Xk). Let ipo{x2, ■ ■ ■ , ^k) and 
ipi{x2, . . . , Xk) be conjunctions of equations expressing Ro{x2, • • • , x^) and Ri{x2, ■ ■ ■ , x^j, respectively; 
these conjunctions exist by induction. 
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If Rq is empty, we may express R{xi, . . . , rc^) by (xi = 1) A V'l- Similarly, if i?i is empty, we may 
express R{xi, . . . , Xk) by (xi = 0) A ipQ. So, assume that both Rq and Ri are non-empty, fix (c2, . . . , c^) to 
be a tuple in Rq, and fix (c2, . . . , c|) to be a tuple in Ri. Define = (0, C2, . . . , c°) and = (1, C2, . . . , c^,). 
Let b C {0, l}'^ be an arbitrary tuple. Observe that if 6 € i?, then (6 © c*' © c^) E R. Moreover, if 
(6 © cO © ?) G i?, then ((6 © © ?) © © ?) = 6 G i?. Thus, b e Rif and only if (6 © ? © ?) G i?. 
Specializing this to bi = 1, we obtain 

(62, . . . , &fe) G -Ri ^ (62 © © C2, . . . , 6fc © © Cfc) G Rq. 

This implies 

(61, . . . , G <^ (62 © 0% © C261, . . . , 6fc © c°6i © clbi) G i?o. 

Thus, the desired equations for R{xi, . . . ,Xk) can be obtained from the equations iPq{x2, 
stituting x[ with {xi © c°xi © cjxi). 



. . . ,x'f.) by sub- 
□ 



An instance of CSP(r) can thus be reduced to the problem of deciding if a conjunction of linear equa- 
tions has a solution: we simply replace each constraint with the equations that are given by Theorem 14.31 
The resulting conjunction of linear equations can be solved by the well-known Gaussian elimination algo- 
rithm. For the sake of completeness, we give a brief description of a polynomial-time algorithm for solving 
these linear equations. First, we observe that it is possible to simplify linear equations so that in each equa- 
tion, each variable appears at most once: this is because two instances of a variable in an equation can be 
removed together. Also, on each side of an equation constants may be added together so that there is at most 
one constant on each side. Next, we observe that an equation e may be removed in the following way. If it 
does not contain a variable, then just check to see if it is true; if it is, then remove it, otherwise report "unsat- 
isfiable". If it does contain a variable, let us suppose for notation that the variable is oi and the equation is 
ai © • • • © a/ = &i © • • • © bm- The equation is equivalent to the equation ai = 02 © • • • © a; © 61 © • • • © bm, 
which we denote by e'. For each of the equations other than e, replace each instance of oi with the right-hand 
side of e'; then, remove e. Perform this equation removal iteratively. Note that by simplifying equations 
after replacements are performed, we may ensure that the number of variable instances in each equation 
never exceeds 2n, where n is the number of variables. 

Exercise 4.4. Generalize the tractability of the operations A and V by showing that any constraint language 
(over a finite domain) having a semilattice operation as a polymorphism is polynomial-time tractable via the 
given arc consistency algorithm. Recall that a semilattice operation is a binary operation that is associative, 
commutative, and idempotent. This tractability result was first proved in H42\l . 

Exercise 4.5. A near-unanimity operation is an operation f : — > D of arity k > 3 such that for all 
elements d, d' G D, it holds that d = f{d', d,d, . . . ,d) = f{d, d' , d, . . . , d) = ■ ■ ■ = f{d, . . . ,d, d'). In 
words, this means that if all but at most one of the inputs to f "agree", then the output is the agreed value. 
(Observe that majority is the unique near-unanimity operation of arity k = 3 over the domain D = {0, 
Prove, by adapting the argument given for the majority operation, that any constraint language T (over 
a finite domain) having a near-unanimity operation as a polymorphism is polynomial-time tractable. This 
tractability result was demonstrated in / [47]/ . 



5 Schaefer intractability 

In this section, we want to complete our proof of Schaefer's theorem by showing that those constraint 
languages not having one of the identified polymorphisms are NP-hard. We first establish a general result 
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on clones over the boolean domain, showing that any such clone either contains one of the four non-unary 
operations of Schaefer's theorem, or is of a highly restricted form, namely, only has operations that are 
essentially unary. We say that an operation / : D'^ — > D is essentially unary if there exists a coordinate i G 
{1, . . . , A;} and a unary operation g : D ^ D such that /(di, . . . , dk) = g{di) for all values di, . . . ,dk G -D. 

Theorem 5.1. A clone over {0, 1} either contains only essentially unary operations, or contains one of the 
following four operations: 

• the boolean AND operation A, 

• the booelan OR operation V, 

• the operation majority, 

• the operation minority. 

Theorem 15.11 follows from a result proved by Post in 1941 which gave a description of all clones over 
the boolean domain |[53l ; see fV2\ for a nice presentation of this theorem. Those readers familiar with 
Rosenberg's theorem on minimal clones 155)1 will recognize that our proof draws on ideas present in the 
proof of that result, although our proof is specialized to the boolean case. 

When / : ^ D is an operation, we will use / to denote the unary function defined by f{d) = 
f{d,...,d) for all d ^ D. We say that an operation / : ^ D is idempotent if the operation / is the 
identity function. As usual, we will use to denote the unary operation on {0, 1} mapping to 1 and 1 to 0. 
Proof. Suppose that C is a clone containing an operation / : {0, 1}^ {0, 1} that is not essentially unary. 
We will show that / generates one of the four operations given in the statement of the theorem. 

We first consider the case when / is a constant operation. Suppose that / is the constant 0; the case 
where / is the constant 1 is dual. Since / is not essentially unary, there exist elements ai, . . . , € {0, 1} 
such that /(ai, . . . , a^) = 1. Note that since / = we have {oi, . . . , o^} = {0, 1}. Now, we define the 
binary function (/ by 5r(xi,xo) = /(xai, ... ,Xa J. We have ^(0, 0) = g{l,l) =Oandg'(l,0) = 1. We 
consider two cases depending on the value of ^(0, 1). If (7(0, 1) = 1 we have that g is the exclusive OR 
operation, and thus g generates the operation minority: minority(x, y, z) = g{x, g{y, z)). If g{0,l) = 
then it can be verified that x Ay = g{x, g{x, y)). 

We now suppose that / is not a constant operation. It follows that / must be the identity mapping or the 
operation From /, we claim that we can generate an idempotent operation /' that is not essentially unary. 
If / itself is idempotent, we take /' = /; if / is the operation we take f'{xi, . . . , xj.) = /(/(xi, . . . , a;^)). 
We have established that the clone contains an idempotent operation that is not essentially unary, or equiv- 
alently, an idempotent operation that is not a projection. Let g : {0, 1}™ — > {0, 1} be an operation of this 
type having minimal arity. We now break into cases depending on the arity m of g. 

Case m = 2: We cannot have g{0,l) ^ g{l,0), otherwise the operation 5 is a projection. Thus 
g{0, 1) = g{l, 0). If g{0, 1) = g{l, 0) = 0, we have that g is the operation A, and if g{0, 1) = g{l, 0) = 1, 
we have that g is the operation V. 

Case m = 3: By the minimality of the arity of g, if any two arguments of g are identified, we obtain a 
binary operation that is a projection onto either its first or second coordinate. Considering the three possible 
ways of identifying two arguments of g, we obtain the following eight possibilities: 



(1) (2) (3) (4) (5) (6) (7) (8) 



g{x,x,y) 
g{x,y,x) 
9{y,x,x) 



xxxxyyyy 
xxyyxxyy 
xyxyxyxy 
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In case (1), we have g = majority, and in case (8), we have g = minority. The cases (2), (3), and (5) cannot 
occur, since then / is a projection. The remaining cases, (4), (6), and (7), are symmetric; in each of them, g 
generates the operation majority. For example, in case (6), we have majority(x, y, z) = g{x, g{x, y, z), z). 

Case m > 4: We show that this case cannot occur. First suppose that whenever two coordinates of g are 
identified, the result is a projection onto the identified coordinates. Then, we have g{xi,xi,x^, . . . , Xm) = 
xi and g{xi, X2,X3, . . . , x^) = x^. But, this implies that g{0, 0, 1, . . . , 1) is equal to both and 1, a 
contradiction. So, there exist two coordinates such that, when identified, the result is a projection onto 
a different coordinate. Assume for the sake of notation that g{xi,xi,xs,X4, . . . ,Xm) = x^. We claim 
that g{xi,X2,xi,X4^, . . . ,Xm) = 2:4. Let j be such that (^(xi, X2, xi, a;4, x^) = Xj. This implies 
g{xi,xi,xi,Xi, . . . , Xm) = Xj. Observe that 

g{xi,xi,X3,Xi, . . . ,Xm) = X4 implies g{xi, xi, xi, x^, . . .,Xm) = x^^ 

and hence We can similarly show that ^(xi, X2, X2, X4, . . . , Xm) = x^. For any assignment to the 

variables xi, . . . , Xm, one of the equalities x\ = X2, x\ = X3, X2 = X3 must hold, since our domain {0, 1} 
has two elements, and thus 5 is a projection. □ 

Next, we establish a lemma on which all of our intractability results will be based. Let us say that an 
essentially unary operation / : D acts as a permutation if there exists a coordinate i G {1, . . . , /c} 

and a bijective operation it : D ^ D such that /(xi, . . . , Xk) = 7r(xj). 

Lemma 5.2. IfT is a finite boolean constraint language such that Pol(r) contains only essentially unary 
operations that act as permutations, then for any finite boolean constraint language T', it holds that CSP(r') 
reduces to CSP(r). 

Proof. If Pol(r) contains only projections, then Pol(r) C Pol(r') and we can apply Theorem l3.15l 

Otherwise, we define a constraint language T" that contains, for every relation R' G F', a relation R" 
defined by i?" = {(0, ti, . . . , t^) : {ti, ■ ■ ■ M G i?'} U {(1, -ti, . . . , -t^) : G R'}. The 

constraint language V" has the operation as a polymorphism, and hence, by the assumption, all operations 
in Pol(r) as a polymorphism. Thus, by Theorem 13. 151 CSP(r") reduces to CSP(r), and it suffices to show 
that CSP(r') reduces to CSP(r"). 

Given an instance (f)' of CSP(r') having variables V , we create an instance 0" of CSP(r") as follows. 
Introduce a fresh variable 60 ^ V, and for every constraint R'{vi, . . . ,Vk) occurring in our instance we 
create a constraint R"{bQ,vi, . . . ,Vk) in the instance (p". Suppose that / : F — > {0, 1} satisfies (p'; then, 
the extension of / mapping 69 to satisfies (p". Suppose that g : V U {bo} — > {0, 1} satisfies 0"; then, if 
9{bo) = 0, the restriction of g( to F satisfies cf)', and if g{bo) = 1, the mapping / : F — > {0, 1} defined by 
f{v) = ^g{v) for all v ^V, satisfies 0'. □ 

We can now prove Schaefer's Theorem. 
Proof. (Theorem 13.201 ) Let F be a finite boolean constraint language. If F has one of the six operations 
given in the theorem statement as a polymorphism, then, as discussed in Section |4l the problem CSP(F) is 
polynomial-time tractable. Otherwise, since F does not have any of the operations {A, V, majority, minority} 
as a polymorphism, by Theorem 15. 11 the clone Pol(F) contains only essentially unary operations. The only 
unary operations on {0, 1} are the two constant operations and the two permutations (the identity and ^). 
Since Pol(F) does not contain either of the two constant operations, the set Pol(F) contains only essentially 
unary operations that act as permutations, and the result follows from Lemma [5^ by taking F' to be a finite 
boolean constraint language where CSP(F') is known to be NP-hard. For instance, we can take F' to be the 
constraint language identified by Proposition 13.51 □ 

We close the section with some exercises. 
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Exercise 5.3. Using Theorem \5.1\ observe that for each of the constraint languages T in Exercises \3.21\ 
\3.24\ the only polymorphisms ofV are the projections. Then, using Theorem \3.13\ observe that (F) is the set 
of all relations. 

Exercise 5.4. Prove that if T is a finite constraint language ( over a finite domain D ), then there exists a 
finite constraint language V (over a finite domain) such that CSP(r) reduces to and from CSP(r'), and all 
essentially unary polymorphisms of T act as permutations. This result was established in / |20|/ . Hint: let 
f : D ^ D be a unary polymorphism ofV having minimal image size, and consider V = f{T). 

The following generalization of Lemma l5.2l is known ||20]| . 

Lemma 5.5. (follows from 1 20 J ) IfT is a finite constraint language (over a finite domain D) whose essen- 
tially unary polymorphisms all act as permutations, then the problem CSP(r) reduces to and from CSP(rc), 
where Fc = T U {{(d)} : (i E Z?}. That is, Tc is the constraint language obtained by augmenting T with all 
unary relations of size one. 

Exercise 5.6. Derive Lemma \52\ from Lemma UTSl 

Exercise 5.7. Prove Lemma \53\ Hint: construct an instance o/CSP(r) with variable set D whose solutions 
f : D ^ D are the unary polymorphisms of T. 

6 Adding quantification to the mix 

In this section, we consider quantified constraint satisfaction problems. We prove a classification result for 
the boolean domain analogous to Schaefer's theorem. The problems we study are defined as follows. 

Definition 6.1. Let T be a finite constraint language. The problem QCSP(r) is to decide the truth of a 
formula of the form Qivi . . . QnVnC, where 

• each Qi G {V, 3} is a quantifier, 

• each Vi is a variable, and 

• C is the finite conjunction of constraints having relations from T and variables from {vi, . . . ,Vn}- 

The sequence Qivi . . . QnVn i^ called the quantifier prefix of the formula. 

For each constraint language F, the problem QCSP(F) is a generalization of CSP(F): the problem 
CSP(F) can be viewed as the restriction of QCSP(F) to instances having only existential quantifiers. It is 
well-known that each problem QCSP(F) belongs to the complexity class PSPACE; it is also known that 
there exist constraint languages F such that QCSP(F) is PSPACE-hard. 

Proposition 6.2. 0^1/ The problem QCSP(F3), where F3 as is defined in Example \3.4\ is PSPACE-hard. 

This section will prove the following theorem, which classifies every problem of the form QCSP(F) as 
tractable or PSPACE-complete. 

Tlieorem 6.3. Let F be a finite boolean constraint language. The problem QCSP(F) is polynomial-time 
tractable ifV has one of the four operations {A, V, majority, minority} as a polymorphism; otherwise, the 
problem QCSP(F) is PSPACE-complete. 



ACM SIGACT News 



19 



Vol. — No. — 



Theorem l6.3l is a "wide dichotomy theorem": it shows that only the complexity classes P and PSPACE — 
and none of the complexity classes inbetween — are characterized by the boolean problems QCSP(r). Also 
note that, in contrast to Schaefer's theorem, here the constant polymorphisms are not strong enough to 
guarantee tractability of QCSP(r); however, see Exercise [6 . 8 1 below for a related result. 

A full proof of Theorem 16.31 was given by Creignou et al. 1281 . although a partial classification was 
claimed by Schaefer |56|. Related work by Dalmau |301 established the full classification given by this 
theorem, assuming Schaefer's unproved partial classification. 

As for the CSP, we can give a notion of definability for the QCSP that allows a constraint language to 
"simulate" further relations. Whereas for the CSP we considered pp-definability, here we consider a more 
general notion of definability where universal quantification is permitted. 



Definition 6.4. We say that a relation R C is few-definablq^/rom a constraint language T if for some 

m > there exists a finite conjunction C consisting of constraints and equalities (u = v) over variables 
{f 1, . . . ,Vk, xi, . . . , Xm} cind quantifiers Qi, . . . , Qm G {V, 3} such that 



We use [r] to denote the set of all relations that are few -definable from T. 

Note that a pp-definition is a special case of a few-definition, and so a relation R that is pp-definable 
from a constraint language F is also few-definable from F, that is, the containment (F) C [F] holds. 

Proposition 6.5. Let F and T' be finite constraint languages. If T' CI [T], then QCSP(F') reduces to 



Proof. The proof resembles the proof of Proposition 13.91 From an instance <I> = V(/) of QCSP(F'), we 
create an instance of QCSP(F) in the following way. We loop over each constraint C = R{vi, . . . , v^) in 
the original instance, performing the following operations for each: let Qixi . . . QmXmC be a few-definition 
of C over F, rename the quantified variables xi, . . . , Xm if necessary so that they are distinct from all other 
variables in the formula, replace C with C in and add QiXi . . . QmXm to the end of the quantifier prefix 
V. The resulting quantified formula may contain equalities [u = v); we can process such equalities one 
by one as follows. Suppose that an equality {u = v) is present and that u comes before v in the quantifier 
prefix. If V is universally quantified, then the formula is false. Otherwise, remove v from the quantifier 
prefix, and replace all instances of v with u. □ 

We now give a theorem that will be used to establish the needed hardness results in the classification. 

Theorem 6.6. IfT is a finite boolean constraint language such that Pol(F) contains only essentially unary 
operations, then for any finite boolean constraint language Tq, it holds that QCSP(Fo) reduces to QCSP(F). 

Proof. We prove this theorem in a sequence of steps. 

Step 1. We define a constraint language F' that contains, for each relation R G Fq, the relation R' = 
{R X {0, 1}) U {(0, . . . , 0), (1, . . . , 1)}. Observe that F' has both constant polymorphisms. In addition, for 
each relation R G Fq, it holds that R{vi, . . . , Vk) = \/y{R'{vi, . . . ,Vk, y)) and thus R € [F']. Thus, by 
Proposition [631 we have that QCSP(Fo) reduces to QCSP(F'). 

^ Where does the name/ew-definable come from? The LaTeX commands for the symbols V, 3, and A. 




R{vi, ...,Vk) = QlXi . . . QmXmC. 



QCSP(F). 
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step 2. In this step, we show that QCSP(r') reduces to QCSP(r") for a constraint language F" having 
all unary operations (over {0, 1}) as polymorphisms. We proceed as in the proof of Lemma [S!2l For 
each relation R' G F', there is a relation R" G F" defined by R" = {(0,ti, ... ,tk) : {h, ...,tk) € 
R'} U {(1, -iti, . . . , -life) : {ti, . . . ,tk) G R'}. Using the fact that F' has the constant operation as a 
polymorphism, it can be seen that F" has all unary operations as polymorphisms. Given an instance 
of QCSP(F'), we create an instance of QCSP(F") as follows. We introduce a new variable bo, and for 
every constraint R'{vi, . . . , v^) appearing in our instance <!>', we create a constraint R"{bo,vi, . . . , vj^) in 
the instance <I>". The quantifier prefix of ^" is Qbo for either quantifier Q G {V, 3} followed by the quantifier 
prefix of that is, the quantifier prefix of <I>" is obtained by adding, to the quantifier prefix of bo as the 
outermost quantified variable. It is straightforward to show that <!>' is true if and only if <I>" is true. 

Step 3. Since F" has all unary operations as polymorphisms, it has all essentially unary operations as 
polymorphisms, and we have Pol(F") 5 Pol(F). By Theorem 13. 151 we have the containment F" C (F), 
and thus QCSP(F") reduces to QCSP(F) by Proposition [O] □ 

We can now prove the classification theorem. 
Proof. (Theorem l6.3l) If F has one of the given operations as a polymorphism, then the problem QCSP(F) 
is tractable; we refer the reader to the original proofs IH |43l HH |28l and to ||25T| for an algebraic approach. 
If F does not have one of the four given operations as a polymorphism, then by Theorem 15. 1[ the set 
Pol(F) contains only essentially unary operations, and QCSP(F) is PSPACE-hard by Proposition 16.21 and 
Theorem l6.6l □ 

Exercise 6.7. When F is a constraint language, let sPol(F) denote the set containing all surjective polymor- 
phisms ofT. Prove the following result, which was established by Borner et al. fJSS- 

Theorem. Let T be a constraint language over a finite domain D. It holds that [F] = lnv(sPol(F)). 
Hint: to show that a relation R G lnv(sPol(F)) is in [F], define R' to be 

R' = {/(tT, ...,*;;;):/ G Pol(F),t- G X D^} 
which is the smallest relation in (F) containing R x W^, and consider 

Vyi . . .\lyd{R'{vi, ...,Vk,yi,.. .,yd))- 
Here, we use d to denote the size of D. 

Exercise 6.8. Let T be a finite boolean constraint language. Prove that if [F] has one of the constant 
operations 0, 1 as a polymorphism, then QCSP(F) is polynomial-time tractable. 

7 Bounded alternation 

In this section, we study the problems QCSP(F) under restricted prefix classes. What do we mean by a 
prefix class? For each instance of the QCSP, we may take the quantifier prefix, eliminate the variables, and 
then group together consecutive quantifiers that are identical to obtain a pattern. For example, from the 
quantifier prefix 3wi3w2^w^yyiiy2^xi3x2^X'i3xi^ we obtain the pattern 3V3. An instance is said to have 
prefix class Si if its pattern is 3, prefix class FEi if its pattern is V, prefix class S2 if its pattern is 3V, prefix 
class 112 if its pattern is V3, and so forth. The collection of formulas having prefix class or 11^ for some 
A; > 1 are said to be of bounded alternation because in their quantifier prefixes, the number of alternations 
between the two different quantifiers is bounded above by a constant. For all > 1 and finite constraint 
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languages T, we define Sfc-QCSP(r) (nfc-QCSP(r)) to be the restriction of QCSP(r) to instances having 
prefix classes (respectively, 11^). We study the prefix classes where the innermost quantifier is existential, 
that is, the classes for odd k and 11^ for even k. It is known that the corresponding problems characterize 
the complexity classes S^, of the polynomial hierarchy. 

Proposition 7.1. / [57l \60\l Let T be a finite constraint language over a finite domain. For even k > 2, 
the problem n^.-QCSP(r) is in H^, and for odd k > 3, the problem Sfc-QCSP(r) is in S^. In addition, 
letting denote the constraint language from Example \3.4\ for even k > 2, the problem nfc-QCSP(r3) is 
H^-complete, and the problem Sfc-QCSP(r3) is T,^-complete. 

This section proves the following classification theorem. 

Theorem 7.2. Let T be a finite boolean constraint language. For all k > 1, the problems nfc-QCSP(r) and 
Sfc-QCSP(r) are polynomial-time tractable ifV has one of the four operations {A, V, majority, minority} 
as a polymorphism; otherwise, 

• the problem nfc-QCSP(r) is H^-complete for even k>2, and 

• the problem Sfc-QCSP(r) is Ti^-complete for odd k > 3. 

Theorem 17.21 can be taken as a fine version of the last section's classification theorem (Theorem 16.31 ). 
showing that the constraint languages shown to be hard in that theorem are also hard under bounded alter- 
nation. Theorem 17. 2 1 has been previously established by Hemaspaandra |[39l and Bauland et al. 

How can we prove this theorem? The tractable cases clearly follow from those of the last section's 
classification theorem, so we are left with proving the hardness results. What happens if we try to establish 
these hardness results by imitating the proof of the hardness results of the last section (that is, the proof of 
Theorem 16.61 )? We become faced with a difficulty: this proof relied on Proposition 16.51 which showed that 
QCSP(r') reduces to QCSP(r), assuming F' C [r]. However, the given proof of this proposition does not 
preserve the prefix class of the original formula: an instance of QCSP(r') is transformed into an instance 
of QCSP(r) by taking few-definitions for the constraints in $ and appending these few-definitions to the 
end of the quantifier prefix of $ ! To overcome this difficulty, we establish a version of Proposition 16. 5 1 which 
does preserve the prefix class. 

Theorem 7.3. Let T and V be finite constraint languages over the same finite domain. IfT'Q [r], then 

• for even k>2, there is a reduction from nfc-QCSP(r') to nfc-QCSP(r), and 

• for odd k > 3, there is a reduction from Sfc-QCSP(r') to Sy!j-QCSP(r). 

To the best of my knowledge, this theorem is new. Although here we only apply this theorem to the 
case of a boolean domain, I would like to emphasize that the theorem gives a general tool that applies for all 
finite domains, which I believe will be useful for studying the bounded alternation QCSP over domains of 
larger size. 

In order to prove Theorem 17. 31 we establish a lemma that shows that the relations in [T] can be expressed 
in a particular way. 

Definition 7.4. Let R C be a relation ofarity k over a finite domain D of size d = \D\. Say that R is 
spread-expressed by R' C 1)^'+'^ if the following two properties hold: 

• (monotonicity) ifbi, . . . ,bd,bi, . . . ,b'^ G D are elements such that {bi, . . . , bd} 5 {b'l, . . . , b'^}, then 
for all (ai, ...,ak)e -D^ R'{ai, . . . ,Ofe, . . . , 6^) implies R'{ai, . . .,ak,b[, . . . , 6'^). 
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• (expression) if bi, . . . ,bd G D are elements such that {61, . . . , bd} = D, then for all (ai, . . . , ak) G 
D^, R{ai, . . . , ak) if and only if R'{ai, . . . , a^, 61, ... , bd). 

Lemma 7.5. Let T be a constraint language over a finite domain D. For every relation R € [F], there exists 
a relation R' S (F) such that R is spread-expressed by R'. 

Proof. We let d denote the size of D and k denote the arity of R. If R is pp-definable from T without 
the use of quantifiers, then let . . . , v^) denote such a definition of R{vi, . . . , Vk)- We define R' by 
R'{vi, ...,vk,yi,...,yd) = 4'{vi, . . .,Vk). 

Now suppose that the lemma is true for a relation R. We want to show that it is true for a relation obtained 
from R by "quantifying away" a coordinate of R. Suppose that R2{xi, . . . , Xk-i) = QxkR{xi, . . . , Xk), 
with Q G {V, 3}. We consider two cases depending on the quantifier Q. In both cases, we use R' to denote 
a relation such that R is spread-expressed by R'. 

Case Q = 3: It is straightforward to verify that R2 is spread-expressed by the relation R2 defined by 

R2{xi, ■ ■ ■ ,xk-i,yi, ...,yd) = 3xkR'{xi, . . . ,xk,yi, ■ ■ ■ ,yd)- 

Case Q = V: We claim that R2 is spread-expressed by the relation R2 defined by 

d 

i?2(xi, . . .,xk-i,yi,.. .,yd) = /\ R'{xi, . . .,xk-i,yi,yi, ■ ■ .,yd)- 

1=1 

We verify this as follows. 

First, we verify monotonicity. Suppose that bi, . . . ,bd,bi, . . . ,b'^ G D are elements such that the con- 
tainment {61, . . . , bd} 5 {b[, . . . , b'^} holds. Let (ai, . . . , a^^i) € D*^^^ be a tuple, and suppose that 
i?2(ai, • • ■ , CLk-i, bi, . . . ,bd) holds. Then, by the definition of R2, we have 

/\ R'{ai,...,ak-i,b,bi,...,bd), implying f\ R'{ai, . . . ,ak-i,b,bi, . . . ,bd). 
6G{fei,...,fed} be{b[,...,b'J 

By the monotonicity of R', this in turn implies that A;)e{6'j b'^} R'{o-i, - ■ ■ , a^-i, b,b'i, . . . , b'^) holds, which 
is equivalent to R'2{ai, • • • , «a,-i, • • • , b'^). 

Next, suppose that 61, . . . , 6^ G D are elements such that {61, . . . , bd} = D, and let (oi, . . . , ak-i) G 
D^^^ be a tuple. We have 

i?2(ai, . . . ,afc_i) 44> 
VfeG-D(i?(ai,...,Ofc_i,6))^ 
V6 G D{R'{ai, Ofc_i, 6, 61, ... , 6^)) <^ 

i?2(ai) • • • > «A:-1; ^1; ■ ■ ■ ,bd)- 

□ 

We now give the proof of Theorem 17. 3 1 
Proof. (Theorem 17.31 ) For each relation i?' G F', there exists a relation R" G (F) such that R' is spread- 
expressed by R". Let <!>' be an instance of nfc-QCSP(F') or Efc-QCSP(F') (with k as described in the state- 
ment of the theorem). Denote the quantifier prefix of <!>' by V' and the conjunction of constraints of by (j)', 
so that = Vc/}'. We create an instance ^" of nfc-QCSP(F) or Sfc-QCSP(F) as follows. First, define V" 
to be a quantifier prefix obtained from V' by introducing new universally quantified variables yi, . . . ,yd and 
placing them next to any group of universally quantified variables in V, so that the prefix class is preserved. 
Next, define (p" to be the conjunction of constraints containing a constraint R"{vi, . . . ,yk,yi, ■ ■ ■ , yd) for 
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every constraint R'{vi, . . . ,Vk) in (/>'. Tiie output of the reduction is the formula = Vcf)", but where 
every constraint is replaced with a pp-formula over F, as in the proof of Proposition 16.5 1 

To verify the correctness of the reduction, we need to show that = V'cl)' is true if and only if = 
V"cl)" is true. This follows from the following cycle of implications. Fix : {yi, . . . , yd} — > -D to be a 
surjective mapping. 

V'(P' VNyi . . . Mydct)" V"cp" ^ Vyi . . . 'iydV'f V'^" is true under go V'cj)' 

□ 

We can now prove the classification theorem. 
Proof. (Theorem l7.2l ) Following the discussion earlier in this section, we need to show that, when P = 11^ 
for an even A; > 2 or P = for an odd A; > 3, the problem P-QCSP(r) is hard for the corresponding 
class of the polynomial hierarchy, assuming that F does not have one of the four given polymorphisms. 
By Theorem 15.11 all polymorphisms of F are essentially unary operations. Let Fq be the finite boolean 
constraint language of Proposition 17. 1[ so that P-QCSP(Fo) is hard for the complexity class for which we 
want to prove hardness. 

We now follow the proof of Theorem 16.61 The main modification we need to make is in step 1. Step 1 
defines a constraint language F' having both constant polymorphisms such that Fq ^ [F'] . Here, we appeal 
to Theorem 1731 ( instead of Proposition [631) to obtain a reduction from P-QCSP(Fo) to P-QCSP(F'). Step 
2 can be carried out in a way that preserves the prefix class of the formula, since the new variable bo can be 
quantified using either quantifier (as noted); this gives us a reduction from P-QCSP(F') to P-QCSP(F") 
for a constraint language F" having all unary operations as polymorphisms. Finally, in step 3, we have 
Pol(F") 5 Pol(F) and hence F" C (F), that is, every relation in F" has a pp-definition in F. Since the 
innermost quantifier of P is existential, it follows that P-QCSP(F") can be reduced to P-QCSP(F). □ 

Exercise 7.6. Describe the complexity of all finite constraint languages in the problems Uk-QCSP (T) for 
all odd k > 1, and the problems Sfc-QCSP(F) /or all even k > 2. 

Exercise 7.7. Give an alternative, algebraic proof of Lemma [Z5l by considering the hint of Exercise \6.7\ 

8 To infinity... and beyond? 

In recent joint work, Manuel Bodirsky and I considered the quantified CSP over domains of infinite size (E]. 
One class of constraint languages that we have considered is the class of equality constraint languages. We 
say that a constraint language over domain D is an equality constraint language if each relation is equality 
definable, by which we mean definable using equalities {u = v) and the usual boolean connectives A, 
and V. As an example, consider the ternary relation R C defined by 

R{x, y, z) = {-^{x = y)) \J {y = z). 

Clearly, the disequality relation ^ is another example of an equality definable relation: 

(x / y) = (^(x = y)). 

As an intuition pump, consider CSP({7^}), the constraint satisfaction problem over the disequality relation. 
The problem CSP({7^}) is the |P>|-colorability problem: given a set of pairs of variables, decide if the 
variables can be colored with elements from D such that each pair has different colors. Over n finite domain, 
the problem CSP({7^}) is of course known to be NP-complete for \D\ > 3. On the other hand, over an 
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infinite domain, the problem CSP({7^}) is trivial: if an instance contains a constraint of the form v ^ v,it 
is not satisfiable, otherwise it is satisfiable by the assignment sending all variables to different values ! 

Within the class of equality constraint languages, we were able to establish a complexity upper bound. 
Let us define a positive constraint language to be an equality constraint language where each relation is 
definable using equalities (n = v) and the positive boolean connectives A and V. As an example, take the 
relation S C defined by 

S{w, X, y, z) = {{w = x) A{x = y)) V {y = z). 

It can be verified that a positive constraint language has all unary operations as polymorphisms. 

For a domain D of any size, the CSP over a positive constraint language is trivial; every instance is sat- 
isfiable by the assignment sending all variables to the same value. In contrast, the QCSP is more interesting. 
We have shown that there exist positive constraint languages F over an infinite domain such that QCSP(r) 
is NP-hard Q ; in fact, the constraint language containing the single relation S just given is an example of 
such a language. This negative result is complemented by the following complexity upper bound. 

Theorem 8.1. [8] Let T be a positive constraint language over an infinite domain, and let 

^ = QiVi . . . QnVn<P 

be an instance o/QCSP(r). The formula <I> is true if and only if the formula 

is true. Hence, QCSP(r) reduces to CSP(r U {^}), and is in NP. 

I would like to highlight a facet of our proof of Theorem 18.11 this proof only uses the fact that the 
quantifier-free part (/> of a QCSP(r) instance <I> has all unary operations as polymorphisms. We can thus use 
the same proof to establish an NP upper bound on the more general class of quantified formulas where the 
quantifier-free part is an arbitrary positive formula! 

Theorem 8.2. (follows from [8]) Let 

$ = QiVi . . . QnVn<i) 

be a formula where (/) is composed from equalities (u = v) and the connectives A, V, and the variables 
are interpreted over an infinite domain. The formula <I> is true if and only if the formula defined in the 
statement of Theorem 18. 7 1 is true. Hence, deciding formulas <I> of the described form is in NP. 

It should be pointed out that the NP upper bound established by Theorems 18.11 and 18.21 can be derived 
from a result of Kozen B6l . However, I believe that the direct reduction to an existentially quantified 
formula given here constitutes a particularly transparent explanation for the inclusion in NP. Again, as with 
the results discussed in the first section of this article, these results are not tied to the particular syntactic 
form of the quantifier-free part (f), but only use the fact that possesses certain polymorphisms. Indeed, our 
proof demonstrates that Theorem 18.21 holds even if the quantifier-free part (f) is given succinctly, say, as a 
circuit; this extension cannot, to the best of my knowledge, be readily derived from the proof of Kozen [46 1. 

What I personally find most interesting here is that we are able to obtain a positive complexity result 
(Theorem 18.21 ) using the notion of polymorphism, for a class of formulas that falls beyond the CSP frame- 
work, that is, does not require a conjunction of constraints. This is a twist of high intrigue: the concept of 
polymorphism has recently come into close focus for its relevance to the specific class of CSP formulas, 
but has now been brought to shed light on a more general class of logical formulas (namely, the positive 
formulas of Theorem 18.21 ). I would like to suggest the search for further applications of polymorphisms to 
general logical formulas. 
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Further reading. For the reader interested in further studying the topics of this article, I offer some point- 
ers. I focus on the topics directly addressed by this article, such as algebraic methods for studying the 
problems CSP(r). However, even regarding these topics, the selection and discussion of references here 
is not at all meant to be comprehensive, but rather is a sampling of the literature that reflects my personal 
interests and biases. 

As we have discussed, Schaefer ll56l was the first to systematically consider the family of problems 
CSP(r); he studied constraint languages T over a two-element domain. While there does not appear to 
have been work on these problems in the 80's, the early 90's saw papers of Hell and Nesetril [38 ] and Feder 
and Vardi 13411351 on these problems. Feder and Vardi |[34l[35ll conjectured that every problem of the form 
CSP(r) for r over a finite domain is either in P or NP-complete; this has become known as the Feder- Vardi 
dichotomy conjecture. The algebraic, polymorphism-based approach to studying the problems CSP(r) was 
introduced in the papers of Jeavons, Cohen, and Gyssens [42 1 and Jeavons HOl : the given references are for 
journal papers which appeared in the late 90's. Other foundational work on this algebraic approach appears 
in the journal paper by Bulatov, Jeavons, and Krokhin (TG]. 

Two major complexity classification results on CSP(r) were achieved by Bulatov: the classification 
of all constraint languages T over a three-element domain ||2ll . and the classification of all conservative 
constraint languages T, defined to be constraint languages containing all unary relations ifTSl . There is by 
now a rich literature on CSP(r) tractability and complexity, including the papers fln[33l[T9llT6ll45l[T7l[3Tl 

[miBiiiiiiii. 

Placing CSP(r) problems in complexity classes "below" P has also been studied. AUender et al. [HI gave 
a refinement of Schaefer's theorem showing each problem CSP(r), where F is a constraint language over 
a two-element domain, to be complete for a known complexity class under AC'^ reductions. See |[32l l3ll49l 
for examples of other papers along these lines. 

A framework for studying the CSP over infinite domains was proposed by Bodirsky [13; see [IHIHllOl for 
subsequent work. The quantified CSP has been studied algebraically in papers including lfT4l l4ll8l l25ll24l . 
For references on relevant algebra, we mention Il59ll52ll23l . Other surveys/overviews on these and related 
topics include [|28l|48l|47l|l2l|l3l|27l. 

Acknowledgements. I extend special thanks to Manuel Bodirsky and Victor Dalmau for many enjoyable 
conversations on the topics of this article. I am also indebted to Andrei Bulatov, Neil Immerman, Benoit 
Larose, Riccardo Pucella, Pascal Tesson, and Matt Valeriote for their numerous helpful comments and 
encouragement. 
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